A systematic comparison of statistical methods to detect interactions in exposome-health associations

نویسندگان

  • Jose Barrera-Gómez
  • Lydiane Agier
  • Lützen Portengen
  • Marc Chadeau-Hyam
  • Lise Giorgis-Allemand
  • Valérie Siroux
  • Oliver Robinson
  • Jelle Vlaanderen
  • Juan R González
  • Mark Nieuwenhuijsen
  • Paolo Vineis
  • Martine Vrijheid
  • Roel Vermeulen
  • Rémy Slama
  • Xavier Basagaña
چکیده

BACKGROUND There is growing interest in examining the simultaneous effects of multiple exposures and, more generally, the effects of mixtures of exposures, as part of the exposome concept (being defined as the totality of human environmental exposures from conception onwards). Uncovering such combined effects is challenging owing to the large number of exposures, several of them being highly correlated. We performed a simulation study in an exposome context to compare the performance of several statistical methods that have been proposed to detect statistical interactions. METHODS Simulations were based on an exposome including 237 exposures with a realistic correlation structure. We considered several statistical regression-based methods, including two-step Environment-Wide Association Study (EWAS2), the Deletion/Substitution/Addition (DSA) algorithm, the Least Absolute Shrinkage and Selection Operator (LASSO), Group-Lasso INTERaction-NET (GLINTERNET), a three-step method based on regression trees and finally Boosted Regression Trees (BRT). We assessed the performance of each method in terms of model size, predictive ability, sensitivity and false discovery rate. RESULTS GLINTERNET and DSA had better overall performance than the other methods, with GLINTERNET having better properties in terms of selecting the true predictors (sensitivity) and of predictive ability, while DSA had a lower number of false positives. In terms of ability to capture interaction terms, GLINTERNET and DSA had again the best performances, with the same trade-off between sensitivity and false discovery proportion. When GLINTERNET and DSA failed to select an exposure truly associated with the outcome, they tended to select a highly correlated one. When interactions were not present in the data, using variable selection methods that allowed for interactions had only slight costs in performance compared to methods that only searched for main effects. CONCLUSIONS GLINTERNET and DSA provided better performance in detecting two-way interactions, compared to other existing methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Systematic Comparison of Linear Regression–Based Statistical Methods to Assess Exposome-Health Associations

BACKGROUND The exposome constitutes a promising framework to improve understanding of the effects of environmental exposures on health by explicitly considering multiple testing and avoiding selective reporting. However, exposome studies are challenged by the simultaneous consideration of many correlated exposures. OBJECTIVES We compared the performances of linear regression-based statistical...

متن کامل

The exposome concept: a challenge and a potential driver for environmental health research.

The exposome concept was defined in 2005 as encompassing all environmental exposures from conception onwards, as a new strategy to evidence environmental disease risk factors. Although very appealing, the exposome concept is challenging in many respects. In terms of assessment, several hundreds of time-varying exposures need to be considered, but increasing the number of exposures assessed shou...

متن کامل

Biomonitoring in the Era of the Exposome

BACKGROUND The term "exposome" was coined in 2005 to underscore the importance of the environment to human health and to bring research efforts in line with those on the human genome. The ability to characterize environmental exposures through biomonitoring is key to exposome research efforts. OBJECTIVES Our objectives were to describe why traditional and nontraditional (exposomic) biomonitor...

متن کامل

Comparison of Strategies to Detect Epistasis from eQTL Data

Genome-wide association studies have been instrumental in identifying genetic variants associated with complex traits such as human disease or gene expression phenotypes. It has been proposed that extending existing analysis methods by considering interactions between pairs of loci may uncover additional genetic effects. However, the large number of possible two-marker tests presents significan...

متن کامل

Development of Exposome Correlations Globes to Map Out Environment-Wide Associations

The environment plays a major role in influencing diseases and health. The phenomenon of environmental exposure is complex and humans are not exposed to one or a handful factors but potentially hundreds factors throughout their lives. The exposome, the totality of exposures encountered from birth, is hypothesized to consist of multiple inter-dependencies, or correlations, between individual exp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2017